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SEQUENCE LISTING 

SEQ ID NO 1: 

cDNA nucleotide sequnce of the GCB gene. 
Underlined sequence: signal peptide 1-57 
Mature peptide: 58-1548 

1 atggc tqqcagcctc acagqtttqc ttctacttca ggcagtgtcg tqqqcatcaq 

56 £tgcccgccc ctgcatccct aaaagcttcg gctacagctc ggtggtgtgt 

106 gtctgcaatg ccacatactg tgactccttt gaccccccga cctttcctgc 

156 ccttggtacc ttcagccgct atgagagtac acgcagtggg cgacggatgg 

206 agctgagtat ggggcccatc caggctaatc acacgggcac aggcctgcta 

256 ctgaccctgc agccagaaca gaagttccag aaagtgaagg gatttggagg 

306 ggccatgaca gatgctgctg ctctcaacat ccttgccctg tcaccccctg 

356 cccaaaattt gctacttaaa tcgtacttct ctgaagaagg aatcggatat 

406 aacatcatcc gggtacccat ggccagctgt gacttctcca tccgcaccta 

456 cacctatgca gacacccctg atgatttcca gttgcacaac ttcagcctcc 

506 cagaggaaga taccaagctc aagatacccc tgattcaccg agccctgcag 

556 ttggcccagc gtcccgtttc actccttgcc agcccctgga catcacccac 

606 ttggctcaag accaatggag cggtgaatgg gaaggggtca ctcaagggac 

656 agcccggaga catctaccac cagacctggg ccagatactt tgtgaagttc 

706 ctggatgcct atgctgagca caagttacag ttctgggcag tgacagctga 

756 aaatgagcct tctgctgggc tgttgagtgg ataccccttc cagtgcctgg 

806 gcttcacccc tgaacatcag cgagacttca ttgcccgtga cctaggtcct 

856 accctcgcca acagtactca ccacaatgtc cgcctactca tgctggatga 

906 ccaacgcttg ctgctgcccc actgggcaaa ggtggtactg acagacccag 

956 aagcagctaa atatgttcat ggcattgctg tacattggta cctggacttt 

1006 ctggctccag ccaaagccac cctaggggag acacaccgcc tgttccccaa 

1056 caccatgctc tttgcctcag aggcctgtgt gggctccaag ttctgggagc 

1106 agagtgtgcg gctaggctcc tgggatcgag ggatgcagta cagccacagc 

1156 atcatcacga acctcctgta ccatgtggtc ggctggaccg actggaacct 

1206 tgccctgaac cccgaaggag gacccaattg ggtgcgtaac tttgtcgaca 

1256 gtcccatcat tgtagacatc accaaggaca cgttttacaa acagcccatg 

1306 ttctaccacc ttggccactt cagcaagttc attcctgagg gctcccagag 

1356 agtggggctg gttgccagtc agaagaacga cctggacgca gtggcactga 

1406 tgcatcccga tggctctgct gttgtggtcg tgctaaaccg ctcctctaag 

1456 gatgtgcctc ttaccatcaa ggatcctgct gtgggcttcc tggagacaat 

1506 ctcacctggc tactccattc acacctacct gtggcatcgc cagtga 
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SEQ ID NO 2: Amino acid sequence codifing human GCB and 
native signal peptide. 
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Lys 


Gly 
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His 
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Trp 
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Asp 


Ala 
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Ala 
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Gly 


Phe 
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Pro 
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Phe 


He 


Ala 


Arg 


Asp 


Leu 


Gly 


Pro 


Thr 


Leu 


Ala 


Asn 


Ser 


Thr 


His 


His 


Asn 


Val 


Arg 


Leu 


Leu 


Met 


Leu 


Asp 


Asp 


Gin 


Arg 


Leu 


Leu 


Leu 
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His 
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Ala 
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Val 
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Thr 


Asp 


Pro 


Glu 


Ala 


Ala 
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He 


Ala 


Val 
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Ala 
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Ala 
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Trp 
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He 
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Gly 
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Thr 
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Gly 


Gly 


Pro 
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Trp 


Val 


Arg 
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Phe 


Val 
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He 


He 


Val 


Asp 
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Asp 
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Phe 


Tyr 


Lys 


Gin 


Pro 


Met 


Phe 


Tyr 


His 


Leu 


Gly 


His 


Phe 
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Lys 


Phe 


He 
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Glu 


Gly 


Ser 


Gin 


Arg 


Val 


Gly 


Leu 


Val 


Ala 


Ser 


Gin 


Lys 


Asn 


Asp 


Leu 


Asp 


Ala 


Val 


Ala 


Leu 


Met 


His 
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Asp 
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Ala 


Val 


Val 


Val 


Val 
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SEQ ID NO 3: forward primer for GCB amplification 
5' : tctagaatggctggcagcctcacaggt 

SEQ ID NO 4: reverse primer for GCB amplification 

5' : gtgtggatggacaccgtagcggtcactctcgag 

SEQ ID NO 5: forward primer for GCB amplification 
5' : cccgggtgcccgcccctgcatccctaaaagc 

SEQ ID NO 6: PGLOB promoter 

1 taaaataatc tatacattaa aaaatttgat tttaaaattt tagaaattca tgattttatt 
61 tttttttacc agaaatccgt taatattgtt aaaatattac caactaattt ataaatttta 
121 ttttaaggca attaagcatg tttgataaaa tatatatatt gttataaata cttttcaaaa 
181 gtataaagtt gatgatggcg tggtggtaga ttattttagt tctaggttcg aatgcaagtt 
241 ggtttagaca tttagcctta ttcttttttc taaccaaaat aaatgtaaat ggaaaacctt 
301 taggaaaaaa aagaaatcaa aattgaaaac atcatccggt ggagtcgaga agcccacacc 
361 cacgtgaccc aacaatatta aaataagagt ttgctctaca gtaaatgcga tactttttta 
421 ttcaatactt tttccacttc taaaatcttg gagatttgca ccgttaacta attaagtgtt 
4 81 atatccaacg gtcctaaaaa aacttgtgta ccgtgcctca catttcaact ttgcgcaccc 
541 tagaagccgt ctatgtttag gttagtgttt gcaacagttg aagcgcatca ctcaggaggc 
601 tacttggtct tgcttttgcg tcttttgttc aatttttcac gtgattttgt tggtgaacac 
661 gcgtacttga aacttattat aaattacata attttataag tttcacttct tatataatac 
721 ttcattcatg catttataat tttgatgaat aataaagagt ttgttaaaaa atatattatt 
781 tcatataata tatagggttt agaatgccaa tttttaaaaa aagaataaaa aaataaatag 
84 1 aataaaatcg aaaaaatgaa atgtaaaaaa tttgaggggg acaaataaaa tatgaaagtc 
901 tattatttaa attttccatt agaattctat tttccttagt taatatgagc tagccagttg 
961 ggagatacac gaaaatgtca tgaaacagtt gcatgtaggg aaattaatgt agtagaggga 
1021 tagcaagaca aaaatccaag ccaagctagc tgctcacgcg aactcgatcc acacgtcctt 
1081 tacagagttt caaacggatg aaatctgcat ggcatgcaac taaagcattg ttctcagctg 
1141 ccaagtaccc ctcacactca ccaacccttt gtttttctcc ccattgcatg ttaactcaag 
1201 tttatccttt ctttgcttct ggaaatttca caagcctcaa acacgtcgac gtccaatctt 
1261 gtgaccaaca cggccaaaag aaaagagaat ctcatcccgt tcacacttag ccacttaaag 
1321 ctagccaaac ggtgatcttt ctctatatat tgtagctctc taacacaacc aacactacca 
1381 ttattcaata ttcaaacctt gctctatact acacacacta gaagaata 
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SEQ ID NO 7: Soy basic glubulin 7S signal sequence 
1 atggcttctat cctccactac tttttagccc tctctctttc ttgctctttt cttttcttct 
61 tatccgactc a 

SEQ ID NO 8: cDNA nucleotide sequence of the GLA gene 
Underlined sequence: signal peptide 21-116 
Sequece coding for the mature peptide 117-1310 
1 aatgctgtcc ggtcaccgtg acaatqcaqc tqaqqaaccc aqaactacat ctqqqctqcq 
61 cqcttqcqct tcqcttcctq gccctcqttt cctqggacat ccctqqqqct aqaqca ctgq 
121 acaatggatt ggcaaggacg cctaccatgg gctggctgca ctgggagcgc ttcatgtgca 
181 accttgactg ccaggaagag ccagattcct gcatcagtga gaagctcttc atggagatgg 
241 cagagctcat ggtctcagaa ggctggaagg atgcaggtta tgagtacctc tgcattgatg 
301 actgttggat ggctccccaa agagattcag aaggcagact tcaggcagac cctcagcgct 
361 ttcctcatgg gattcgccag ctagctaatt atgttcacag caaaggactg aagctaggga 
421 tttatgcaga tgttggaaat aaaacctgcg caggcttccc tgggagtttt ggatactacg 
4 81 acattgatgc ccagaccttt gctgactggg gagtagatct gctaaaattt gatggttgtt 
541 actgtgacag tttggaaaat ttggcagatg gttataagca catgtccttg gccctgaata 
601 ggactggcag aagcattgtg tactcctgtg agtggcctct ttatatgtgg ccctttcaaa 
661 agcccaatta tacagaaatc cgacagtact gcaatcactg gcgaaatttt gctgacattg 
721 atgattcctg gaaaagtata aagagtatct tggactggac atcttttaac caggagagaa 
781 ttgttgatgt tgctggacca gggggttgga atgacccaga tatgttagtg attggcaact 
841 ttggcctcag ctggaatcag caagtaactc agatggccct ctgggctatc atggctgctc 
901 ctttattcat gtctaatgac ctccgacaca tcagccctca agccaaagct ctccttcagg 
961 ataaggacgt aattgccatc aatcaggacc ccttgggcaa gcaagggtac cagcttagac 
1021 agggagacaa ctttgaagtg tgggaacgac ctctctcagg cttagcctgg gctgtagcta 
1081 tgataaaccg gcaggagatt ggtggacctc gctcttatac catcgcagtt gcttccctgg 
1141 gtaaaggagt ggcctgtaat cctgcctgct tcatcacaca gctcctccct gtgaaaagga 
1201 agctagggtt ctatgaatgg acttcaaggt taagaagtca cataaatccc acaggcactg 
1261 ttttgcttca gctagaaaat acaatgcaga tgtcattaaa agacttactt taaaatgtt 

SEQ ID NO 9: Amino acid sequence coding for GLA and native 
signal peptide 

Thr Met Gin Leu Arg Asn Pro Glu Leu His Leu Gly Cys Ala Leu Ala 
Leu Arg Phe Leu Ala Leu Val Ser Trp Asp lie Pro Gly Ala Arg Ala 
Leu Asp Asn Gly Leu Ala Arg Thr Pro Thr Met Gly Trp Leu His Trp 
Glu Arg Phe Met Cys Asn Leu Asp Cys Gin Glu Glu Pro Asp Ser Cys 
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Lys 


Ala 


Leu 


Leu 


Gin 


Asp 
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Val 
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Gly 
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Arg 
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Leu 







SEQ ID NO 10: Forward primer for GLA amplification 
5' : ggatccctggacaatggattggcaaggac 

SEQ ID NO 11: Reverse primer for GLA amplification 
5' : gtctacagtaattttctgaatgaaattctatag 
SEQ ID NO 12: cDNA GAA. 

Underlined sequence: signal peptide 220-426 
Sequence coding for the mature peptide 427-3075 



1 cagttgggaa agctgaggtt gtcgccgggg ccgcgggtgg aggtegggga tgaggcagca 
61 ggtaggacag tgacctcggt gaegegaagg accccggcca cctctaggtt ctcctcgtcc 
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121 gcccgttgtt cagcgaggga ggctctgggc ctgccgcagc tgacggggaa actgaggcac 
181 ggagcgggcc tgtaggagct gtccaggcca tctccaacc a tgqqagtgaq qcacccqccc 
241 tqctcccacc qqctcctqqc cgtctqcqcc ctcqtqtcct tqqcaaccqc tqcactcctq 
301 gggcacatcc tactccatqa tttcctqctq qttccccqaq aqctgaqtqq ctcctcccca 
361 gtcctggagq aqactcaccc aqctcaccaq cagggagcca qcaqaccaqq qccccqqqat 
421 gcccag gcac accccggccg tcccagagca gtgcccacac agtgcgacgt cccccccaac 
481 agccgcttcg attgcgcccc tgacaaggcc atcacccagg aacagtgcga ggcccgcggc 
541 tgctgctaca tccctgcaaa gcaggggctg cagggagccc agatggggca gccctggtgc 
601 ttcttcccac ccagctaccc cagctacaag ctggagaacc tgagctcctc tgaaatgggc 
661 tacacggcca ccctgacccg taccaccccc accttcttcc ccaaggacat cctgaccctg 
721 cggctggacg tgatgatgga gactgagaac cgcctccact tcacgatcaa agatccagct 
781 aacaggcgct acgaggtgcc cttggagacc ccgcgtgtcc acagccgggc accgtcccca 
841 ctctacagcg tggagttctc cgaggagccc ttcggggtga tcgtgcaccg gcagctggac 
901 ggccgcgtgc tgctgaacac gacggtggcg cccctgttct ttgcggacca gttccttcag 
961 ctgtccacct cgctgccctc gcagtatatc acaggcctcg ccgagcacct cagtcccctg 
1021 atgctcagca ccagctggac caggatcacc ctgtggaacc gggaccttgc gcccacgccc 
1081 ggtgcgaacc tctacgggtc tcaccctttc tacctggcgc tggaggacgg cgggtcggca 
1141 cacggggtgt tcctgctaaa cagcaatgcc atggatgtgg tcctgcagcc gagccctgcc 
1201 cttagctgga ggtcgacagg tgggatcctg gatgtctaca tcttcctggg cccagagccc 
1261 aagagcgtgg tgcagcagta cctggacgtt gtgggatacc cgttcatgcc gccatactgg 
1321 ggcctgggct tccacctgtg ccgctggggc tactcctcca ccgctatcac ccgccaggtg 
1381 gtggagaaca tgaccagggc ccacttcccc ctggacgtcc aatggaacga cctggactac 
14 41 atggactccc ggagggactt cacgttcaac aaggatggct tccgggactt cccggccatg 
1501 gtgcaggagc tgcaccaggg cggccggcgc tacatgatga tcgtggatcc tgccatcagc 
1561 agctcgggcc ctgccgggag ctacaggccc tacgacgagg gtctgcggag gggggttttc 
1621 atcaccaacg agaccggcca gccgctgatt gggaaggtat ggcccgggtc cactgccttc 
1681 cccgacttca ccaaccccac agccctggcc tggtgggagg acatggtggc tgagttccat 
1741 gaccaggtgc ccttcgacgg catgtggatt gacatgaacg agccttccaa cttcatcaga 
1801 ggctctgagg acggctgccc caacaatgag ctggagaacc caccctacgt gcctggggtg 
1861 gttgggggga ccctccaggc ggccaccatc tgtgcctcca gccaccagtt tctctccaca 
1921 cactacaacc tgcacaacct ctacggcctg accgaagcca tcgcctccca cagggcgctg 
1981 gtgaaggctc gggggacacg cccatttgtg atctcccgct cgacctttgc tggccacggc 
2041 cgatacgccg gccactggac gggggacgtg tggagctcct gggagcagct cgcctcctcc 
2101 gtgccagaaa tcctgcagtt taacctgctg ggggtgcctc tggtcggggc cgacgtctgc 
2161 ggcttcctgg gcaacacctc agaggagctg tgtgtgcgct ggacccagct gggggccttc 
2221 taccccttca tgcggaacca caacagcctg ctcagtctgc cccaggagcc gtacagcttc 
2281 agcgagccgg cccagcaggc catgaggaag gccctcaccc tgcgctacgc actcctcccc 
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cactgt tcca 


ccaggcccac 


y^.wywyyy>^>j 


aqaccotaac 


prfifTrrrTfr 1 

wwyy l» >— V-r l_ 


2401 


ttcctooaat 


tccccaaaaa 

%m V* V* V* V* ^ 


ctctagcacc 


taoactotoo 

^yyawwy«-yy 


accaccagct 


ectgeggggg 


2461 


cracfficcct ac 


teatcaeccc 


aatoctccaa 


yv»wyyyuuyy 


ccgaagtgac 


f- ri r*t t~ a fr - 


2521 


ceettacraca 


w« Lyy CI ^ y a 


cctacaoacci 

wv* i-ywuyciwy 


cftoccaatao 

y w y w w o w wy 


aaacccfctcrcr 

3 3 33 


LayCCUUCCd 


2581 


cccccacctci 


UO^V' L W W W *w y 


toaaccaocc 


atccacaaco 


aaaaacaata 


yy Lydtgccg 


264 1 


rcaacccccc 


fnnarflrrat 


caacotccac 


ctccoaacta 


aatacatcat* 


wU.wwwL<Jwciy 


2701 


acrccctoacc 


tcacaaccac 


aaaotcccac 

d W %Jk W w W w w W 


caacaoccca 


t QO c r* p t" rro r* 

*»yywv»v». k>yyv» 


^ytgyccucg 


2761 


accaacrcrcrt cr 




aaaaaaacta 

W* V4 ^4 t* \4 • w \J 


ttctaooaca 

V* w k« \J VH S» W 


atooaaacrao 

u w y y 0 y w y d y 


uuty tjady L y 


2821 


ct aaaaccracf 

w l» y y Q *H w y u y 


aaaccta.ca.c 


acaaatcatc 


t tcctaacca 


aoaataacar 

y y v«a a w d c& w u 


y d Luy (.yctaL 


2881 


oaactocrtac 


ort" fit" fTsrT*Ari 


taaacrciacfct 


V4 M W w V- v-# C* w 


t err* a a a a a rr "h 

wywuyuayy l. 


yauty L^tty 


294 1 


aac.at.ao cca 

3 3 3 3 J 




ocaaatcctc 


tccaacaato 

w w w a w* xj ^4 ^ 


tccctotcte 




3001 


tacaoccccn 


acaccaaaat 


cctaoacatc 

w w w y y c* *w k*» w w 


totatctccc 


tat taatrtrra 

y cyu uyy y 


3 3 3 LUL. 


3061 




y y y i* tayu^ 




ty u v, c* y ^ v u v> 


\.v^uayciyyy a 


yyv«.Lyy u luv_ 


3121 




ayflytuiy i-y 




oy^ty i- y tyu 


fTff nrr*t orrorr 


yttyuoiyuy 


3181 


tcacctggag 


ctgggcacta 


accattccaa 


gccgccgcat 


cgcttgtttc 


cacctcctgg 


3241 


gccggggctc 


tggcccccaa 


cgtgtctagg 


agagctttct 


ccctagatcg 


cactgtgggc 


3301 


cggggcctgg 


agggctgctc 


tgtgttaata 


agattgtaag 


gtttgccctc 


ctcacctgtt 


3361 


gccggcatgc 


gggtagtatt 


agccaccccc 


ctccatctgt 


tcccagcacc 


ggagaagggg 


3421 


gtgctcaggt 


ggaggtgtgg 


ggtatgcacc 


tgagctcctg 


cttcgcgcct 


gctgctctgc 


3481 


cccaacgcga 


ccgcttcccg 


gctgcccaga 


gggctggatg 


cctgccggtc 


cccgagcaag 


3541 


cctgggaact 


caggaaaatt 


cacaggactt 


gggagattct 


aaatcttaag 


tgcaattatt 


3601 


ttaataaaag 


gggcatttgg 


aatc 









SEQ ID NO 13: Amino acid sequence of human GAA and native 
signal peptide 



Ala 


His 


Pro 


Gly 


Arg 


Pro 


Arg 


Ala 


Val 


Pro 


Thr 


Gin 


Cys 


Asp 


Val 


Pro 


Pro 


Asn 


Ser 


Arg 


Phe 


Asp 


Cys 


Ala 


Pro 


Asp 


Lys 


Ala 


He 


Thr 


Gin 


Glu 


Gin 


Cys 


Glu 


Ala 


Arg 


Gly 


Cys 


Cys 


Tyr 


He 


Pro 


Ala 


Lys 


Gin 


Gly 


Leu 


Gin 


Gly 


Ala 


Gin 


Met 


Gly 


Gin 


Pro 


Trp 


Cys 


Phe 


Phe 


Pro 


Pro 


Ser 


Tyr 


Pro 


Ser 


Tyr 


Lys 


Leu 


Glu 


Asn 


Leu 


Ser 


Ser 


Ser 


Glu 


Met 


Gly 


Tyr 


Thr 


Ala 


Thr 


Leu 


Thr 


Arg 


Thr 


Thr 


Pro 


Thr 


Phe 


Phe 


Pro 


Lys 


Asp 


lie 


Leu 


Thr 


Leu 


Arg 


Leu 


Asp 


Val 


Met 


Met 


Glu 


Thr 


Glu 


Asn 


Arg 


Leu 


His 


Phe 


Thr 


He 


Lys 


Asp 


Pro 


Ala 


Asn 


Arg 


Arg 


Tyr 


Glu 


Val 


Pro 


Leu 


Glu 


Thr 


Pro 


Arg 


Val 


His 


Ser 


Arg 


Ala 


Pro 


Ser 


Pro 


Leu 


Tyr 


Ser 


Val 


Glu 


Phe 


Ser 


Glu 


Glu 


Pro 


Phe 


Gly 


Val 


He 


Val 


His 


Arg 


Gin 


Leu 


Asp 


Gly 


Arg 


Val 


Leu 


Leu 


Asn 


Thr 


Thr 


Val 


Ala 


Pro 


Leu 


Phe 


Phe 


Ala 


Asp 


Gin 


Phe 


Leu 


Gin 


Leu 


Ser 


Thr 


Ser 


Leu 


Pro 


Ser 


Gin 


Tyr 


He 


Thr 


Gly 


Leu 


Ala 
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Glu 


His 


Leu 


Ser 


Leu 


Tro 

X X O 


Asn 


Arcf 


Ser 


His 


Pro 


Phe 


Val 


Phe 


Leu 


Leu 


Pro 


Ala 


Leu 


Ser 


Phe 


Leu 


Glv 


Pro 


Val 

V O X 


Glv 


Tvr 
x y x 


Pro 

t X w 




A ro 


Trn 

X XL; 


Glv 


Asn 


Met 


Thr 

X 1 I X 


Ar cj 


n o ^ 


Tvr 
i y x 


Met 


A so 

n.o £J 


A ro 
n x y 


A so 


Phe 

rue 


Pro 

I- X w 


Tvr 
l yi. 


Met 


Met 

L 1C L, 


Tie 

X X c 


Ser 

OCX 


Tvr 
l yx 


A rrr 


P ro 


Asn 


Glu 


Thr 

X 1 IX 


Glv 


Ala 


Phe 


Prn 




Met 


Val 


Ala 


Glu 


A 


Mor 

L 1c L. 


Asm 
noli 


G 1 ii 


Pro 


Asn 


Asn 


Glu 


Glv 
oxy 


Thr 


Leu 


Gin 


Q o v- 


Thr 


Hi s 
n x a 


iyr 


Ala 


Ser 


His 


Arg 


lie 

X J- C 


Ser 

OCX 


A rn* 


Ser 

OCX 


Thr 

X 1 1 X 


Glv 


A so 

xTi O LV 


Val 

vox 


Gl 11 


Tie 

X X C 


T.en 

HC Li 


Gin 

O X 11 


Val 


Cys 


Gly 


Phe 


Thr 


Gin 


Leu 


Glv 

voxy 


XC U 


Ser 

OCX 


T.en 

XC v-i 


Pro 


nia 


Mer 


A to 
n x y 


T.vs 
xj y o 


iyr 


Thr 

X 1 1 X 


T.en 
ne u 


Phe 

t lie 


Pro 


Leu 


Phe 


Leu 


His 


Gin 


Leu 


Leu 


Ala 


Gly 


Lys 


Ala 


Asp 


Leu 


Gin 


Thr 


Pro 


Ala 


Ala 


Pro 


Thr 


Leu 


Pro 


Ala 


Tyr 


He 


He 


Pro 


Gin 


Gin 


Pro 


Met 



Pro 


Leu 


Met 


Leu 


Asp 


Leu 


Ala 


Pro 


Tvr 

x y x 


Leu 


Ala 


Leu 


noil 


^e r 

OCX 


Asn 
no 1 1 


Al a 
nx 0 


T TO 

x rp 


A rrr 

n x y 


Ser 


Thr 

X 1 IX 


Glu 


Pro 


Lys 


Ser 


Phe 

n its 


Met- 
tle L. 


P TO 
t X \J 




T w t 

i yr 


Cpr 
OCX. 


OCX 


Th r 

X (11 


Ala 


His 
n x o 


Phe 

L 1 1C 


Pro 


Cay 
OCX 


A TO 

nx y 


A rrr 
nx y 


A so 
n 0 


Ala 


Mer 

l lC L. 


Va 1 
v d i 


Gin 
0111 


V aX 


A so 


P ro 


Al a 

nla 


T\/t 

iyr 


Ben 

no 


Gl n 

O 1 u 


fZl XT 

oxy 




P TO 


T .01 1 

XjC U 


Tie 
lie 


Phe 
rue 


Thr 

X 1 1 X 


Asn 
no 1 1 


Pro 

X X 0 


Phe 


His 


Asp 


Gin 


t x O 


Cor 
OCX 


7\ en 
noil 


Pho 
x J lc 


T.o 1 1 
Xj C U 


Gin 

LJX Li 


Asn 
no 1 1 


Pro 

XXL/ 


A 1 a 
nla 


Ala 
nla 


Thr 

X 111 


Tie 

X X c 


no 11 


Ton 

XjC Li 


H-i s 
n 1 0 


Asn 
non 


Ala 
nla 


Toil 
XJC u 


Va 1 
vol 


Xj y O 


Th t 
1 Fix 


Pho 
C lie 


Al a 
nla 


L3i y 


T TO 

i rp 


^e r 

OCX 


^e r 

OCX 


T ro 


Phe 


Asn 


Leu 


Leu 


Tien 

XI C Ll 


Glv 

V3 x y 


Asn 


Thr 


Al a 
nla 


Phe 
rue 




Pro 

t X LV 


Gin 


Glu 


P ro 


Tvr 
x y 1 


Ala 

X CI 


T.en 

XjC Ll 


Thr 

X 1 1 X 


T.en 

XJC LI 


Hi s 
nlo 


will 


Al a 
old 


Hi s 
mo 


Glu 


Phe 


Pro 


Lys 


Trp 


Gly 


Glu 


Ala 


Glu 


Val 


Thr 


Gly 


Val 


Pro 


He 


Glu 


Arg 


Glu 


Pro 


Ala 


Pro 


Leu 


Asp 


Thr 


Leu 


Gin 


Gly 


Pro 


Ala 


Leu 


Ala 


Val 



Ser 


Thr 


Ser 


Tro 

J. X 0 


Thr 


Pro 


Glv 

oiy 


Ala 

nx 0 


Glu 


Asp 


Gly 


Glv 
w x y 


Met 


Asp 


Val 

vox 


Va 1 
vox 


Glv 

oxy 


Glv 

vjxy 


He 

X X c 


T.en 

XJC Ll 


Val 


Val 


Gin 


Gin 


Tvr 

x jr x 


Tro 


Glv 


Leu 


Ala 
n x a 


Tie 
x x e 


Thr 
x 1 11 


A Tn 
ni y 


Leu 


Asp 


Val 

vox 


Gin 


Phe 


Thr 


Phe 


Asn 


Glu 


Leu 


His 


Gin 

w XII 


He 

X X c 


Ser 

OCX 


OCX 


Cor 

OCX 


Leu 


Arg 


Arg 


Glv 


Glv 


Lys 


Val 


Trn 

X X w 


Thr 


Ala 


Leu 


Ala 


Va 1 

vol 


P TO 
L LU 


Pho 
x lie 


A <2Tl 

no lj 


He 


Arg 


Glv 


Ser 


Pro 

X X 0 


Tvr 
x yi 


Val 

vox 


Pro 

XXL/ 


Cys 


Ala 


Ser 

OCX 


Ser 

OCX 


T.en 

XjC Ll 


Tvr 
1 yr 


Gl v 
vj 1 y 


T.en 
XjC ll 


Ala 


Arg 


Glv 

vjxy 


Thr 


His 


Glv 

oxy 


A rcr 


Tvr 
x y 1 


Glu 


Gin 


Leu 


Ala 


Glv 
y 


Val 
vox 


Pro 


T.en 

XJC Ll 


Ser 


Glu 


Glu 


Leu 


Phe 


Met 


Arg 


Asn 


Ser 


Phe 


Ser 


Glu 


A TO 

ni y 


Tvr 
1 yr 


Ala 
ni a 


T.01 1 

XjC Ll 


Val 

vox 


Ala 


Glv 


Glu 

\J X u 


Asp 


Ser 


Ser 


Th *- 

1 nr 


Leu 


Leu 


He 


Thr 


Tyr 


Phe 


Pro 


Leu 


Ala 


Leu 


Gly 


Ser 


He 


His 


Ser 


Glu 


lie 


Asn 


Val 


His 


Gly 


Leu 


Thr 


Thr 


Ala 


Leu 


Thr 


Lys 



Thr 

X 1 1 X 


A yrf 
m y 


Tie 
lie 


Th t 
1 1 ir 


Asn 


T.en 

XjC Ll 


Tvr 
1 yi 


VJ 1 y 


OCX 


Ala 

nx 0 


Hi s 
n x 0 


Gl V 

01 y 


T.en 

XjC Ll 


Gl n 

Vj XII 


P TO 

riv 


OCX 


A SO 

no Lv 


Va 1 
vol 


T\/ T 

iyr 


T 1 0 
lie 


iyr 


Leu 


Asp 


Vdl 


gi v 
<oi y 


Dh 0 


rlis 


Leu 


ri I n 
oiu 


Va 1 

Vol 


Va 1 

Vol 


Gl 11 
Vol U 


T rn 

1 rp 


noil 


nop 


Leu 


Xj y O 


Asn 
no kJ 


rjii 
uiy 


Pho 
r lie 


Gl V 

La 1 y 


Gl \/ 
La 1 y 


nl y 


Arg 


r* 1 \/ 


Pro 


Ala 
nla 


pi ,, 
<o±y 


V ax 


Ph 0 

fne 


T 1 0 

lie 


Th v- 

1 nr 


x ro 


gi \r 


oer 


Th v 

1 nr 


T TO 

i rp 


T TO 

1 rp 


Ol U 


Hen 
nop 


ioiy 


iYie L. 


Trp 


T "J Q 

1 le 


Glu 

vj X Li 


A so 
n.o lv 


Glv 

Vj X y 




Glv 

vj x y 


Va 1 
vox 


Val 
vox 


Gl v 
0 1 y 


nlS 


PI n 
loin 


Dho 

fne 


Leu 


Th r 

1 nr 


PI 11 


Ala 

nia 


T 1 0 

ire 


Arg 


rlO 


fne 


Va 1 

v ax 


Al a 
mo 


uxy 


Hi s 
mo 


T m 

irp 


OCX 


Q 0 r 

OCX 


Va 1 
vox 


riu 


Va 1 

Vol 


P.1 w 
vj 1 y 


Ala 
nl 0 


Asp 


Cys 


Val 


Arg 


Tm 


ni 0 


7\ c n 

noil 


Q a r* 

oer 


Leu 


P TO 

x X \J 


Al a 

n x 0 


Gl n 
0111 


Gl n 

0111 


T.en 

XjC Ll 


P TO 

t 1 W 


His 
nl 0 


T.o 1 1 

XJC Li 


Th t 

X 111 


Va 1 
Vol 


Ala 
nla 


nr y 


Trp 


Tnr 


IT - I 

Val 


Asp 


Pro 


Val 


Leu 


Gin 


Gly 


Thr 


Trp 


Tyr 


Leu 


Pro 


Pro 


Pro 


Gly 


Gin 


Trp 


Val 


Leu 


Arg 


Ala 


Gly 


Thr 


Glu 


Ser 


Arg 


Gly 


Gly 


Glu 


Ala 
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Arg Gly Glu Leu Phe Trp Asp Asp Gly Glu Ser Leu Glu Val Leu Glu 
Arg Gly Ala Tyr Thr Gin Val lie Phe Leu Ala Arg Asn Asn Thr lie 
Val Asn Glu Leu Val Arg Val Thr Ser Glu Gly Ala Gly Leu Gin Leu 
Gin Lys Val Thr Val Leu Gly Val Ala Thr Ala Pro Gin Gin Val Leu 
Ser Asn Gly Val Pro Val Ser Asn Phe Thr Tyr Ser Pro Asp Thr Lys 
Val Leu Asp lie Cys Val Ser Leu Leu Met Gly Glu Gin Phe Leu Val 
Ser Trp Cys 

SEQ ID NO 14: Forward primer for GAA amplification 
5' : gatatctgcacaccccggccgtcccag 

SEQ ID NO 15: Reverse primer for GAA amplification 
5' : gtcaaagagcagtcgaccacaatcctatag 



